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Abstract — In this work, we are concerned with maximizing the 
lifetime of a cluster of sensors engaged in single-hop communica- 
tion with a base-station. In a data-gathering network, the spatio- 
temporal correlation in sensor data induces data-redundancy. 
Also, the interaction between two communicating parties is well- 
known to reduce the communication complexity. This paper 
proposes a formalism that exploits these two opportunities to 
reduce the number of bits transmitted by a sensor node in a 
cluster, hence enhancing its lifetime. We argue that our approach 
has several inherent advantages in scenarios where the sensor 
nodes are acutely energy and computing-power constrained, but 
the base-station is not so. This provides us an opportunity to 
develop communication protocols, where most of the computing 
and communication is done by the base-station. 

The proposed framework casts the sensor nodes and base- 
station communication problem as the problem of multiple 
informants with correlated information communicating with a 
recipient and attempts to extend extant work on interactive 
communication between an informant-recipient pair to such 
scenarios. Our work makes four major contributions. Firstly, 
we explicitly show that in such scenarios interaction can help in 
reducing the communication complexity. Secondly, we show that 
the order in which the informants communicate with the recipient 
may determine the communication complexity. Thirdly, we pro- 
vide the framework to compute the m-message communication 
complexity in such scenarios. Lastly, we prove that in a typical 
sensor network scenario, the proposed formalism significantly 
reduces the communication and computational complexities. 

I. Introduction 

A typical wireless sensor network consists of sensor nodes 
with limited energy reserves. Many sensor network applica- 
tions expect the sensor nodes to be active for months and 
may be years. However, in most of the situations, once the 
sensor nodes run out of their energy reserves, then replacing 
their batteries is not possible either due to the inaccessibility 
of sensor nodes or because such an endeavor may not be 
economically viable. So, there is a great demand and scope of 
the strategies which attempt to reduce the energy consumption, 
hence increase the lifetime of the sensor nodes. 

Sensor nodes spend energy in receiving and transmitting 
data, sensing/actuating, and computation. In this paper, we 
concern ourselves with reducing the energy cost of trans- 
mission. The energy cost of receiving data can be easily 
incorporated in the model that we propose. The energy spent 
in sensing/actuating represents a fixed cost that can be ignored. 
We assume computation costs are negligible compared to radio 
communication costs. Though this is a debatable assumption 



in dense networks, but incorporating computation costs is not 
straightforward and is left for future work. 

The transmission energy depends on three factors: the 
number of bits to be transmitted, the path-loss factor between 
the sensor nodes and the base-station, and the time available 
to transmit the given number of bits. The path-loss factor 
describes the wireless channel between a sensor node and 
the base-station and captures various channel effects, such as 
distance induced attenuation, shadowing, and multipath fading. 
For simplicity, we assume the path-loss factor to be constant. 
This is reasonable for static networks and also the scenarios 
where the path-loss factor varies slowly and can be accurately 
tracked. The idea of varying the transmission time to reduce 
the energy consumption was proposed in [1] and explored 
in [2] in the context of sensor networks, where its MV- 
hardness is discussed. In this paper, we attempt to reduce the 
transmission energy by reducing the number of bits transmitted 
by the sensor nodes to the base-station. 

In a data gathering sensor network, the spatio-temporal 
correlations in sensor data induce data-redundancy. In [3], 
Slepian and Wolf show that it is possible to compress a set 
of correlated sources down to their joint entropy, without 
explicit communication between the sources. This surprising 
existential result shows that it is enough for the sources to 
know the probability distribution of data generatecQ. Recent 
advances [4] in distributed source coding allow us to take 
advantage of data correlation to reduce the number of bits that 
need to be transmitted, with concomitant savings in energy. 
However, finding the optimal rate allocation lying in Slepian- 
Wolf achievable rate region defined by 2 N — 1 constraints for 
N nodes and designing efficient distributed source codes is a 
challenging problem. We simplify this problem by allowing 
the interaction between the base-station and the sensor nodes, 
and introducing the notion of instantaneous decoding [2]. This 
reduces the rate allocation problem to the problem of finding 
the optimal scheduling order, albeit at some loss of optimality. 

Two results in the theory of interactive communication 
provided further motivation for our work. First result states that 
even if the feedback does not help in increasing the capacity 
of a communication channel, it does help in reducing the com- 
plexity of communication [5], [6]. Second result states that for 
the worst-case interactive communication, for almost all sparse 

'Actually, the knowledge of conditional entropies suffices. 



pairs the recipient, who has nothing to say, must transmit 
almost all the bits in an optimal interactive communication 
protocol and informant transmits almost nothing [7]. 

Based on these results, this paper proposes a formalism that 
attempts to reduce the number of bits sent by a sensor node. 
The proposed formalism casts the problem of many sensor 
nodes communicating with a base-station as a problem where 
multiple informants with correlated information communicate 
with single recipient. Here we identify the base-station as 
the recipient of the information and the sensor nodes as 
the sources of information. However, [7] and subsequent 
papers [8] — [ 11] consider only the interactive communication 
between a single informant-recipient pair, while in the sensor 
networks, we have as many informant-recipient pairs, as the 
number of sensor nodes. If the sensor data is assumed to be 
uncorrelated, then the results of [7] can be trivially extended 
to the present scenario. However, in a data-gathering sensor 
network, the sensor data is supposed to be correlated. So, 
extending the results in [7], [9], [11] to the scenarios where 
multiple informants with correlated information communicate 
with single recipient and then applying those results to the 
sensor networks, is not straightforward. 

Formally, the problem is the following. There are N sources 
of correlated information and there is one recipient of infor- 
mation V, which needs to collect the information from these 
N sources. Assume that V can interact with any of those N 
sources, but among themselves these N sources cannot interact 
directly. At the end of communication, each of these N sources 
need not know what other N—l sources know. We are looking 
for the most efficient communication schemes (ones which 
minimize the communication complexity) for this problem. In 
the context of sensor networks, if the base-station knows the 
joint distribution of the sensor data or the correlations therein, 
then it can play some distributed version of the 'game of 
twenty questions' with N sensors to retrieve their information. 
If there is a single sensor node or if the sensor data are 
uncorrelated, then the base-station can 'play' with individual 
sensor nodes and any node i needs to send at least Hi(X) bits, 
where Hi(X) is the entropy of the information source at node 
i. So, assuming the sensor data at all the nodes are identically 
distributed, finally the base-station retrieves NH(X) bits from 
the N sensors. However, in a more realistic situation, the 
sensors have correlated data and the base-station needs to 
retrieve only H(X\, ...X N ) < NH{X) bits. Note that we can 
talk in terms of entropies only when we are concerned with 
average number of bits communicated. However, if we are 
interested in the worst-case number of bits, then it depends 
solely on the cardinality of the support set of the data of 
individual nodes than on the corresponding probabilities. 

In this work, we make two simplifying assumptions: the 
sensor nodes communicate with the base-station in a single- 
hop and only the total number of bits exchanged between the 
sensor nodes and the base-station are considered and this is 
what we refer to as 'communication complexity'. This paper 
shows that the communication complexity depends on the 
model of the spatial-correlation of the sensor data as well as 



on the order in which the sensor nodes communicate with 
the base-station. Under the assumption of an omniscient base- 
station, the base-station can compute the optimal number of 
bits, which any node in any schedule needs to send to it for 
any spatial correlation model of the sensor data. However, a 
sensor node, even if it knows in how many bits it needs to 
send its information to the base-station, may not actually be 
able to compress its information to that many number of bits, 
without explicit knowledge of how its data is correlated with 
the data of the other nodes. In general, this is neither possible 
given the limited knowledge of the network that a sensor node 
is supposed to have, nor desirable given the limited energy and 
computational capabilities of the sensor nodes. An omnipotent 
base-station can take up the most of the burden of computation 
and communication, allowing the the sensor nodes to perform 
minimal computation and transmit minimal number of bits, 
hence conserving their precious energy reserves and possibly 
increasing their operational lifetime. 

In the section HU we give an example of the situation where 
multiple informants with correlated information communicate 
with single recipient. We use this example to illustrate the 
complexity of communication in such scenarios and various 
other fundamental issues and our results. Section Hill develops 
the formalism to compute the worst-case communication com- 
plexity in the scenarios where multiple correlated informants 
communicate with a single recipient. Section [IV] illustrates 
some of the ideas proposed and developed in this paper in 
the context of a sensor network with one particular model of 
spatial correlation of sensor data. Finally, the section [V] lists 
the contributions of our work and concludes the paper. 

II. An Example 

There are N groups {Si,l < i < N}, and each group 
has t teams. In every match, two teams from two different 
groups play against each other. The result of each match is 
announced over radio. Matches always result in a clear winner. 
The format of the radio announcements is: "Today the teams 
from groups / and J played against each other. The match 
between teams i and j was won by team k." Three persons 
Px, Py, and Pz are involved. Px listens to the first part of the 
announcement "Today the teams from groups / and J played 
against each other." and then the radio is snatched by person 
Py, who listens to the portion "The match between teams i 
and j was won by" before the radio is snatched from him by 
Pz, who listens to the portion "team k". Now all three persons 
agree that Px must know which two teams played and who 
the winner was (Py and Pz need not learn which two teams 
from which two groups actually played and which one finally 
won). They want to find the most efficient way to do this. 

Px wants to know which two teams from the groups / 
and J played and which one actually won. Suppose only Py 
communicates with Px, then Px will only know which two 
teams from the groups / and J actually played the match, but 
not the winner. On the other hand, if only Pz communicates 
with Px, then Px knows the teams from which two groups 
played and who was the winner, but he may not know who 



the other team was. In this problem, it is essential that the 
identity of the group to which a team belongs to is included 
in the name of the team, that is, the names of the teams have 
to be globally unique. Suppose on the contrary, that the names 
of the teams are only unique within a group, then two or more 
groups might have the the teams with the same name. So, in 
the event of a match between two teams from two different 
groups with the same name playing against each other, Px will 
not be able to make the winner out of the information sent to 
him by Pg, even if Py has already informed Px of which 
two teams have played. So, we are demanding that once Py 
and Pz have communicated all their information to Px, then 
Px must be able to unambiguously infer which two teams had 
played as well as who the winner was. The previous argument 
proves that this demand is satisfied if and only if the team 
names are globally unique. 

It should be noted that the total number of bits exchanged 
to complete the entire process of communication depends very 
much on the format of the announcement. For example, in the 
original problem in [7], if the format of the announcement 
is: "the match between teams i and j is won by first/second 
team" with the protocol that the 'first' ('second') corresponds 
to the first (second) team mentioned in the announcement. In 
such a situation, only one bit needs to be sent to Py, with or 
without interaction between Px and Py. 

A. Without interaction 

Py sends [log A] + [log t \ bits to identify one of the groups 
and the team from it. After this message, it does not need 
to identify the second group as it is obvious to Px- So, Py 
needs to send another [logt] bits to enable Px know which 
is the team from the other group (message 1, 2). Pz sends 
[log A + log t] bits to Px to help him know who the winner 
was (message 3). 

Px sends : nothing (1) 

Py sends : flog iV] + [logf| + [logf| 

= [log TV] +2[log*l 

Pz sends : [log A + log t] 

B. With interaction 

There are two scenarios here. One where Py communicates 
its information to Px before Pz communicates with Px- The 
other scenario is the one where Pz sends his information to 
Px before Py sends. We have to compute the number of bits 
exchanged for both the scenarios. 

When Py communicates with Px before Pg: Px knows 
the two groups from which two teams played against each 
other. Px encodes in [log TV] bits the names of the groups. 
So, Px sends in [log log N] bits the first bit location at which 
the encodings of the two groups differ to Py (message 1). 
Py on its turn, sends the value of the first bit at which the 
encodings of two groups differ along with the [logt] bits to 
identify the team within one of the groups. To help identify the 
team from the other group it just needs to send [logi] bits to 
Px as the identity of the group to which this team belongs to 



is already known to Px by now (message 2, 3, 4). At the end 
of this step, Px knows which two teams had played. So, now 
in [log(log N + \ogt)~\ bits it sends the identity of the first bit 
location at which the encodings of the two teams differ to Pz 
(message 5) and Pz responds by sending the value of that bit 
(message 6). With this Px can determine who the winner was. 
So under this scenario, the number of bits exchanged are: 

P x sends : [log log A] + [log(log A + log t)] (2) 
Py sends : 1 + [logt] + [logt] = 1 + 2\\ogf] 
Pz sends : 1 

When Pz communicates with Px before Py: Px sends 
in [log log A] bits the location of the first bit at which the 
encodings of the two groups differ to Pz (message 1). Pz 
in its turn, sends the value of the bit at that location along 
with the encoding of the winning team in [logi] number of 
bits (message 2, 3). At the end of this step, Px knows which 
was the winning team and to which group it belonged to too. 
So, all that Px does not know now is that which team from 
the other group also played in the match. It sends to Py the 
location and value of the first bit at which the encodings of two 
two teams differ in 1 + [log log A] bits (message 4, 5). Py 
then responds by sending [log t] number of bits to identify 
the team from the given group (message 6). So under this 
scenario, the number of bits exchanged are: 

Pa- sends : [log log A] + 1 + [log log A] (3) 

= l + 2[loglogA] 

Py sends : [logt] 

P z sends : 1 + [log t] 

Note fl}, ©, and ® show that if we consider, total 
number of bits exchanged in the entire communication or 
the total number of bits transmitted by persons Py and Pz 
together, interaction helps in reducing the number of bits 
compared to when no interaction is allowed. Further, when Pz 
communicates with Px before Py, the number of exchanged 
bits are less than when Py communicates with Px before Pz- 

If we adopt the convention that all the messages sent by a 
source, until some other source sends the messages, form one 
message, then in all the above situations at most four messages 
are exchanged. So, in a communication protocol following this 
convention, the source concatenates all the messages that it 
sends before some other source begins to send the messages, 
and receiver knows how to parse the concatenated message 
into its individual messages. 

As [7], [8], [11] and the example above prove, the inter- 
action reduces the number of bits exchanged between the 
informants and the recipient. However, from the above exam- 
ple, it also becomes clear that even for the given number of 
messages exchanged between the informants and the recipient, 
in general, the number of bits exchanged depends on the order 
in which the informants communicate with the recipient. So in 
the above example, with four messages allowed, the number of 
bits exchanged between Px,Py, and Pz depend on whether 



Py communicates with Px before Pz or after. We conjecture 
that this is so due to "somewhat" asymmetric nature of the 
distribution of the information at the nodes Py and Pz of this 
particular example and loosely, we can say that the messages 
from Pz contain more information than those from Py. 

III. Worst case interactive communication: 

MULTIPLE INFORMANTS SCENARIO 

In this section we attempt to develop a theory of worst-case 
interactive communication. To keep the discussion simple and 
clear, we consider a general scenario involving two informants 
(Py and Pz) and one recipient (Px)- However, the same 
formalism can be extended for the scenarios involving more 
than two informants. For the sake of completeness and to 
facilitate the discussion that follows, let us reintroduce the 
notion of ambiguity set and some related concepts, defined 
originally in [7]. 

Let (X,Y) be a random pair, with support set S X: y. The 
support set of X is the set 



S, 



def 



{x : for some y, (x, y) e S x ,y} 



of possible X values. Sy, the support set of Y, is similarly 
defined. Py's ambiguity set when his random variable Y takes 
the value y € Sy is 



Sx\y(v) d = f {x 



(x,y) G S x ,y}, 



(4) 



y. His ambiguity in 



the set of possible X values when Y 
that case is 

1*x\y{v) = \S X \y{y)\, 

the number of possible X values when Y = y. The maximum 
ambiguity of (X, Y) is 



def 

jj X \Y = sup{fi X \Y(y) ■ y e S Y }, 



(5) 



the maximum number of X values possible with any given Y 
value. 

Assume that a total of 2m messages are allowed, with m 
messages per informant allowed to be exchanged between the 
informant and the recipient. There are two possible schedules 
in which the informants can communicate with the recipient: 
either Py communicates first with Px or Pz communicates 
first. In the spirit of [7], let us introduce some more definitions: 

@2m (y> Z/X): 2m-message worst-case complexity of 
transmitting Y, Z to a person who already knows X, when 
Y communicates first. 

C"2m Z / X): 2m-message worst-case complexity of 
transmitting Y, Z to a person who already knows X, when 
Z communicates first. 

Using these definitions, we have: 



(Y, Z/X) = C m (Y/X) + C m (Z/X, Y) 



CfJ(Y, Z/X) = C m (Z/X) + C m (Y/X, Z) 



(6) 
(7) 



Proof: We omit the detailed proof for the sake of brevity. 
However, the league example of the previous section provides 
an example supporting the statement of the theorem. ■ 

Corollary: The unbounded interaction complexities satisfy 
C Y J(Y,Z/X) + C Z J(Y,Z/X). 

It is easy to prove the following trivial, but quite useful 
lower bounds on the unbounded interaction complexities. 

Lemma 1: For all (X, Y, Z) tuples, 



Coo(Y/X) 
C X (Z/X, Y) 
Cl z (Y,Z/X) 



> 
> 
> 



riog/V|xi, 

riogAz|x,y~|, 
|~ lo g /Veri- 



similar bounds exists for Coo (Z/X), Coo(Y/X,Z), and 
Cg Y (Y,Z/X). 

Since empty messages are allowed, it is obvious that 
C m (Y/X) is a decreasing function of m. This holds true 
for other complexities, such as C m (Z/X) etc too. This fact 
together with previous lemma implies that 



C m (Y/X) 
C m (Z/X, Y) 
ClZ(Y,Z/X) 



> 
> 
> 



[logp, Y \x], 
ri°gAz|x,y~|, 

With similar bounds for C m (Z/X), C m (Y/X,Z), and 
CiZ(Y,Z/X). 

We can use above results to find the communication com- 
plexity of the version of the league problem discussed in the 
previous section and arrive at the results of (O and (0. If we 
identify the sets X, Y, and Z appropriately, we can directly 
use the results from [7]. For example, the relevant support sets 
Sy,x, Sz;X,y> &z,x, an d S Y; x,z, with 1 < i ^ j < N are: 

S Y ,x = {({k,l),(S l ,S 1 )),keS l ,leS 1 } 

Szx,y = {(k,(m,n)),m G Si,n e Sj,k G {m,n}} 

Szx = {{k,(S l ,S 1 )),keS l US ] } 

Syx.z = {(k, Si), Si is not the winner's group, k E Si} 

IV. Application to Sensor Network 

In this section, we apply the formalism developed in the 
previous sections to illustrate the computation of worst-case 
and average-case interactive communication complexities. We 
assume following spatial correlation model for the sensor data. 

Let Xi be the random variable representing the sampled 
sensor reading at node i e {1, . . . , N} and B(Xi) denote the 
number of bits that the node i has to send to the base-station. 
Let us assume that each node i has at most n number of 
bits to send to the base-station, so B(Xi) = n. However, due 
to the spatial correlation among sensor readings, each sensor 
may send less than n number of bits. Let us define a data- 
correlation model as follows. 

Let dij denote the distance between nodes i and j. Let us 
define B(Xi/Xj), the number of bits that the node i has to 
send when the node j has already sent its bits to the base- 
station, as follows: 



Theorem 1: In general, C 2 „f (Y, Z/X) ± C£ (Y, Z/X) 



B(Xi/Xj) 



n if d 



if dij < n 

> n 



(8) 
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Fig. 1. Data Correlation Model for n = 5: plot of B(Xi/Xj) versus dij 

Figure Q] illustrates this for n = 5. It should be noted that 
when \dij~\ < n, the data of the nodes i and j differ in at 
most B(Xi/Xj) least significant bits. So, the node i has to 
send, at most, [dy] least significant bits of its n bit data. 

From the definition above in ((HJ, follows the symmetry of 
the conditional number of bits: 

B{X i /X j )=B{X j /X i ) (9) 

However, the definition of the correlation model is not 
complete yet and we must give the expression for the number 
of bits transmitted by a node conditioned on more than one 
node already having transmitted their bits to the base-station. 
There are several ways in which this quantity can be defined, 
we choose the following definition: 

B(Xi/X 1: AVi) = min B{X l /X J ) (10) 
i<j<% 

So, with this definition, the number of bits transmitted by 
any node in a schedule depends only on the node nearest to 
it among all the nodes already polled in the schedule. 

A. Worst-case communication 

For a given schedule n, first node 7r(l) transmits its n bits 
to the base-station. The omniscient base-station based on its 
knowledge of the correlation model in dHJ as well as all the 
internode distances, knows that its maximum ambiguity about 
second node's data is 2 S(X "< 2 > /X "< 1 > ) , as 2 B(X "( 2 ) /X -( 1 ) ) bit- 
patterns are possible for the B(X 7r ^2)/X 7r t 1 \) least-significant 
bits of 7r(2)'s data. So, even if we allow unbounded interaction 
between the base-station and the node 7r(2), it follows from the 
results of the previous section that at least i?(X 7r ( 2 )/^ 7 r(i)) 
bits are exchanged, even by the optimal communication proto- 
col. However, here we are more concerned with demonstrating 
the reduction in the communication complexity due to the 
interaction between the sensor nodes and the base-station 
rather than with proposing the schemes which achieve the 
optimal lower bounds of the communication complexity. So, 



here we propose an almost optimal protocol for the commu- 
nication between the base-station and a sensor node. Base- 
station informs the node 7r(2) in [log-E^X^^/X^i))] bits 
(if B(X 7T ^ 2 )/^tt(i)) — 1> men the base-station sends one 
bit to the corresponding node) to transmit i?(X 7I .( 2 )/X w ( 1 )) 
least significant bits of its information and the sensor node 
7r(2) responds by sending corresponding bits. Note that if no 
interaction is allowed between the base-station and the sensor 
node, then the sensor node has to send n bits to the base- 
station. Continuing this process, the base-station queries all 
the N nodes. So, the worst-case communication complexity 
CV of schedule tt is the sum of the total number of bits sent 
by the base-station and the number of bits sent by the sensor- 
nodes, in the worst-case. It is given by: 

CV = [logB(X„ {2) /X n{1) )]+... 

+ \log B(X W ( N )/Xn(i), . . . ,^ ff (Ar-i))l 
+B(X 7T (2)/Xi T ( 1 )) + . . . 

+i?(X 7r ( Ar) /X Tr ( 1 ), . . . ,X 7r(A r_ 1 )) (11) 

Let n be the set of all possible schedules. Then optimally 
minimum value of is achieved for that schedule i £ II 
that solves the following optimization problem: 

Cmin = min CV (12) 
?ren 

Given the definition of the correlation model in dD and 
([Tol l, it is easy to see that the optimum schedule is generated 
by a greedy scheme that chooses the next node in the schedule 
(from the set of the nodes not already scheduled) to be that 
node that is nearest to the set of already scheduled nodes. 

B. Average-case communication 

As noted above in the discussion of the correlation model 
of dHJ, the data of two correlated nodes i and j can differ in 
at most B(Xi/Xj) number of least significant bits. However, 
it is not necessary that it actually differs in those many bits. 
So, there can be the situations where even if the base-station 
has estimated that the node i has to send B(Xi/Xj) number 
of its least-significant bits and communicated this to the node 
i, the node i's data differs from the the data of node j in 
less than B(Xi/Xj) least-significant bits. In such situations, 
it is sufficient for the base-station to reconstruct the data of 
the node i if the node i sends only those bits where its data 
actually differs from that of node j. 

Given the B(Xi/Xj) number of its least-significant bits that 
the node i has to send to the base-station, there are 2 B ( Xi / x ^ 
possible bit-patterns, one out of which the node i has to 
communicate to the base-station. Assuming that the each of 
these 2 B ^ Xi l x ^ bit-patterns are uniformly distributed with 
probability l/2 B ( Xi / Xj \ then following the typical Huffman- 
coding argument, the node i can communicate its data to the 
base-station in at most B(Xi/Xj) bits on average. 

Given a communication schedule tt, the first node in 
the schedule tt(1) sends its n bits to the base-station. 
Based on this and the knowledge of the correlation model 
given in ©, the base-station informs the second node 7r(2) 



in [logB(X 7r / 2 )/-X' 7r n))] bits to send its information in 
B(X 7r /2\/X n t 1 \) bits. Assuming the uniform-distribution, the 
node 7r(2) sends the requested information in B(X n (2) / 'X^tW) 
bits, in average. Continuing this process, the base-station 
queries all the N nodes. So, the average-case communication 
complexity C v of schedule tt is the sum of the total number 
of bits sent by the base-station and the number of bits sent, 
on average, by the sensor-nodes. It is given by: 

C% = riog J B(X 7r(2) /X 7r(1) )]+... 

+ [log-B(X 7I . (A T)/X 7r ( 1 ), . . . ,X^(jv-i))l 

+-B(X 7r ( 2 )/X 7r ( 1 )) + . . . 

+-B(X 7r (jv)/X 7r ( 1 ),...,J£ w (jv-i)) (13) 

The optimal value of CV is achieved for that schedule tt 6 IT 
that solves the following optimization problem: 

<?mm= min CV (14) 
?ren 

It is easy to see that, once again, the optimum schedule is 
generated by a greedy scheme that chooses the next node in 
the schedule to be that node that is nearest to the set of the 
nodes which are already scheduled. 

Remark: Comparing (fTTT i and (fT3l . it may appear that the 
average-case performance of a schedule is no better than 
its worst-case performance, but it should be noted that the 
average-case analysis is done for the uniform distribution of 
the bit-patterns, that gives the maximum entropy. For any other 
distribution of the bit-patterns, the average communication 
complexity will be lesser than CV given by 1131 . 

V. Conclusions and Future Work 

This paper proposes a new framework, based on exploiting 
the redundancy induced by the spatio-temporal correlations 
in the sensor data and the reduction in communication com- 
plexity due to interaction, to reduce the total number of 
bits sent by the sensor nodes in a single-hop data-gathering 
sensor network. The proposed formalism views the problem of 
many sensor nodes communicating with the base-station as the 
problem of many informants with the correlated information 
communicating with single recipient. We extend various exist- 
ing results on single informant-recipient pair communication 
to the present case. We show that such extensions lead to 
various non-trivial new results. Finally, we apply this new 
framework to compute the worst-case and average-case com- 
munication complexities of a typical sensor network scenario 
and demonstrate the significance of our contribution. 



Major contributions of our work are the following: 

1) We show that interaction helps in reducing the commu- 
nication complexity also in the scenarios where more 
than one informant are involved. 

2) We show that when the ambiguity is more than two, then 
no fixed number of messages are optimum. 

3) We show that for the scenarios involving more than one 
informant, the order in which the informants commu- 
nicate with the recipient may determine the commu- 
nication complexity. We conjecture that if the nodes 
which have 'more' information communicate first, then 
this brings down the overall communication complexity. 
Essentially, we need a metric to quantify the 'amount of 
information', but not in 'Shannon-sense'. 

4) We show that when multiple informants communicate 
with a recipient, the m-message complexity of com- 
munication between informant Y and recipient X, can 
be computed by directly modifying the hypergraph 
G(Y\X) based on the information provided by the 
previous informants in the communication schedule. 

In future, we would like to give more formal and detailed 
exposition of our work presented in this paper. 
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